Picture for Kai Han

Kai Han

and Other Contributors

Scaling Parallel Sequence Models to Foundation-Scale Vision Encoders

Add code
May 30, 2026
Viaarxiv icon

iVGR: Internalizing Visually Grounded Reasoning for MLLMs with Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

CodeBind: Decoupled Representation Learning for Multimodal Alignment with Unified Compositional Codebook

Add code
May 18, 2026
Viaarxiv icon

Near-Policy: Accelerating On-Policy Distillation via Asynchronous Generation and Selective Packing

Add code
May 07, 2026
Viaarxiv icon

Sculpt4D: Generating 4D Shapes via Sparse-Attention Diffusion Transformers

Add code
Apr 23, 2026
Viaarxiv icon

Mask Is What DLLM Needs: A Masked Data Training Paradigm for Diffusion LLMs

Add code
Mar 16, 2026
Viaarxiv icon

MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning

Add code
Mar 14, 2026
Viaarxiv icon

Speed3R: Sparse Feed-forward 3D Reconstruction Models

Add code
Mar 09, 2026
Viaarxiv icon

Surgical Post-Training: Cutting Errors, Keeping Knowledge

Add code
Mar 02, 2026
Viaarxiv icon

DLLM Agent: See Farther, Run Faster

Add code
Feb 07, 2026
Viaarxiv icon